Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise
نویسنده
چکیده
This paper describes an efficient method of improving the noise-robustness of speech recognition in a noisy car environment by considering the acoustic features of a car's interior noise. We analyzed the relationship between the Articulation Index values and the recognition rates in car environments under different driving conditions. We clarified that the recognition rate significantly worsens when the engine noise (periodic sound) components in the frequency range above 200 Hz were large. We developed a preprocessing method to improve the noiserobustness despite large amounts of engine noise. With this method, the cutoff frequency of the front-end high-pass filter is adaptively changed from 200 through 400 Hz according to the level of the engine noise components. The use of this method improved the average recognition rate for all eight cars under the second range acceleration condition by 11.9%, with the recognition rate for one of the cars being improved considerably by 38.6%.
منابع مشابه
Auditory-based Acoustic Distinctive Features and Spectral Cues for Robust Automatic Speech Recognition in Low-SNR Car Environments
In this paper, a multi-stream paradigm is proposed to improve the performance of automatic speech recognition (ASR) systems in the presence of highly interfering car noise. It was found that combining the classical MFCCs with some auditory-based acoustic distinctive cues and the main formant frequencies of a speech signal using a multi-stream paradigm leads to an improvement in the recognition ...
متن کاملComparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for robust automatic speech recognition in low-SNR car environments
This paper presents an evaluation of the use of some auditorybased distinctive features and formant cues for robust automatic speech recognition (ASR) in the presence of highly interfering car noise. Comparative experiments have indicated that combining the classical MFCCs with some auditory-based acoustic distinctive cues and either the main formant magnitudes or the formant frequencies of a s...
متن کاملSpeech recognition in noisy car environment based on OSALPC representation and robust similarity measuring techniques
The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. The OSALPC (One-sided Autocorrelation Linear Predictive Coding) representation of the speech signal has shown to be attractive for speech recognition because of its simplicity and its high recognition performance with respect to the standard LPC in severe conditions of additive white...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملCzech language database of car speech and environmental noise
This paper will present new Czech language twochannel (stereo) speech database recorded in car environment. The created database was designed for experiments with speech enhancement for communication purposes and for the study and the design of a robust speech recognition systems. It respects car noise environment which is currently at the top of the interest. Tools for automated phoneme labell...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004